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^ ' Abstract 
CN ; 

Negative correlations in the distribution of prime numbers are found to display a scale 
""q^I invariance. This occurs in conjunction with a nonstationary behavior. We compare the 



prime number series to a type of fractional Brownian motion which incorporates both 
j5 I the scale invariance and the nonstationary behavior. Interesting discrepancies remain. 

The scale invariance also appears to imply the Riemann hypothesis and we study the 
use of the former as a test of the latter. 
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The distribution of prime numbers has been a source of fascination for mathematicians for 
centuries. Of interest to physicists is the fact that the distribution of prime numbers exhibits 



m 
o 

O ■ certain characteristics that are usually associated with chaotic and complex systems. The 
data on the prime number series is plentiful and a surprisingly rich structure is uncovered 
^ I when this data is probed in various ways The closely associated zeros of the Riemann zeta 
5^ I function have been studied even more intensely from a physics viewpoint [2]. 

We shall show that prime number differences display a scale invariance in their correlations. 
The differences also display a nonstationary behavior that coexists with the scale invariance. 
We shall identify a random process, fractional Brownian motion of a lesser known type, that 
displays these and other properties of the prime number series. But there are still quantitative 
differences which suggest that a further generalization of fractional Brownian motion remains 
to be developed. The resolution of this puzzle could have practical value for descriptions of 
complex systems in physics and economics. In addition we shall describe how a breakdown of 
the Riemann hypothesis would not be compatible with the preservation of scale invariance. 
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This provides some further insight and an additional test of the Riemann hypothesis in terms 
of concepts famihar to physicists. 
Consider the difference 

Li(a;) -7r(a;). (1) 

Li(x) is the logarithmic integral function, the principal value of 1/ \n{t)dt, and 7i{x) is 
the prime counting function, the number of primes less than or equal to x. The Riemann 
hypothesis is equivalent an upper bound on the absolute value of this difference, with a 
particular example being |Li(x) — 7r(a;)| < ^/x ln{x) / {8tt) for x > 2657 [3J. We extract from 
([1]) the discrete series 

bi^ = U{p,) - t, (2) 

where the ith prime number is Pi. The positive increments Li(pj+i) — Li(pj) have a mean close 
to unity and a distribution close to exponential for sufficiently large sample size. Thus to first 
approximation b{i) executes something close to a random walk as it "diffuses" away from zero 
for increasing i. But it is certainly not a true random walk [4j. The Riemann hypothesis is 
a nontrivial constraint on how far \b{i)\ can depart from zero, while at the same time b{i) 
remains positive to some extremely large value of i. 

A study of correlations among the differences, b{i + j) — b{i), will help to characterize 
more precisely how b{i) differs from a true random walk. For convenience the correlations 
we consider will be constructed from the sequences {6(1 ■ 2"), 6(2 ■ 2"), 6(3 ■ 2"), 6(2"^ ■ 2")} 
determined by integers m > 3 and n > 0. From these we obtain the sequences of the 2'" — 1 
successive differences, 

A(m, n) = {6(2 ■ 2") - 6(1 ■ 2"), 6(3 ■ 2") - 6(2 ■ 2"), ...}. (3) 

We wish to investigate the correlations among the entries of A(m, n). For increasing n these 
sequences will probe correlations on larger and larger scales, and n can be thought to label a 
scale transformation from the nearest neighbor differences (n = 0) to larger differences. For 
each A(m, n) consider the two sequences x and y formed by dropping the first k and last k 
elements of A(m, n) respectively. Then calculate the Pearson correlation 

,^f-'^':f>fa7' (4) 

In the language of discrete time series, Ck{m, n) is an autocorrelation of lag k calculated from 
a sample of 2"* primes. 

We highlight the sequences for m from 10 to 16 with n ranging from up to 37 — m. We 
don't need all primes up to the 2'^^th prime, which is about 3.8 x 10^^, but only those to fill 
out the A(m, n) sequences. The results for the correlations Ci{m,n) and C2(m, n) are shown 
in Fig. (1) and Fig. (2) respectively. Note that we have added a 0.1 vertical spacing between 
the points of different m. 
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Figure 1: Ci{m,n) — (16 — m)/10 for m = 16, 15, 14, 13, 12, 11, 10 from top to bottom. 
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Figure 2: C2{m,n) — (16 — m)/10 for m = 16, 15, 14, 13, 12, 11, 10 from top to bottom. 



We perform a least-squares linear fit to the data for each m and obtain the 95% confidence 
interval for the slopes, as listed in the following Table. 
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m 


Slope(Ci) 


Ci(m) 


Slope(C2) 


C2(m) 


16 




0.0011, -0.00015] 


-0.078 ± 0.0079 




-0.00026, 0.00019] 


-0.030 ± 0.003 


15 




0.0012, -0.00012] 


-0.085 ± 0.0094 




-0.00036, 0.00037] 


-0.033 ± 0.005 


14 




-0.0011, 0.00005] 


-0.094 ±0.010 




-0.00025, 0.00078] 


-0.036 ± 0.008 


13 




-0.0010, 0.0005] 


-0.104 ±0.013 




-0.00039, 0.00094] 


-0.040 ±0.012 


12 




-0.0013, 0.0004] 


-0.116 ±0.015 




-0.00032, 0.0014] 


-0.045 ± 0.016 


11 




-0.0011, 0.0010] 


-0.133 ±0.020 




-0.0007, 0.0011] 


-0.046 ±0.018 


10 




-0.0020, 0.0004] 


-0.152 ±0.024 




-0.0011, 0.0013] 


-0.048 ± 0.025 


9 




-0.002, 0.001] 


-0.17 ±0.04 




-0.001, 0.002] 


-0.06 ±0.04 


8 




-0.004, 0.001] 


-0.20 ±0.06 




-0.002, 0.003] 


-0.06 ±0.07 


7 




-0.003, 0.002] 


-0.24 ±0.07 




-0.002, 0.003] 


-0.10 ±0.08 


6 




-0.03, 0.003] 


-0.31 ± 0.09 




-0.006, 0.003] 


-0.09 ±0.13 


5 




-0.006, 0.003] 


-0.37 ±0.15 




-0.0007, 0.0011] 


-0.09 ±0.22 


4 




-0.006, 0.008] 


-0.45 ± 0.20 




-0.0011, 0.0013] 


-0.09 ±0.35 


3 




-0.02, 0.004] 


-0.49 ±0.27 




-0.002, 0.01] 


0.05 ± 0.45 



For completeness we have extended the Table to smaller values of m; for m from 3 to 9, n 
ranges from to 40 — m. All slopes are consistent with zero. Therefore we define Ckini) as 
the mean of the Ck{'m,n)^s for each m. Note that the quoted errors on the Cfc(m)'s are the 
standard deviations (f« -^1^) of the Cfe(m,n)'s about their means; the standard deviations of 
the Ci.(m)'s themselves are times smaller. 

^ ' V38— m 

The first point is that these correlations, which would be consistent with zero for a true 
random walk, are instead nonvanishing and negative. And although the \Ck{m) \ decrease for 
increasing m, the significance of their departure from zero increases. Negative correlations 
indicate a tendency for mean reversion. This phenomenon is clearly connected with the 
Riemann hypothesis, since it indicates that the near random movements have less tendency 
to drive b{i) from zero than would otherwise be the case for a true random walk. The second 
point is that the correlations for each m fluctuate around a mean value as the scale parameter 
n is changed. The correlations thus display a (statistical) self-similarity or scale invariance. 

In Figs. (3) and (4) we display how the variance and kurtosis of the distribution of differ- 
ences depend on the scaling parameter n. The variance for a true random walk would grow as 
(7^2" where is the variance of the nearest neighbor differences, and this corresponds to the 
horizontal line in Fig. (3). As expected the observed variances grow as a smaller power. From 
the behavior of the kurtosis we see that the distribution of differences quickly approaches the 
Gaussian form for increasing n. As mentioned above, the distribution of the nearest neighbor 
differences {n — 0) has the form of a shifted exponential. At small n one can see that there 
are also corresponding small anomalies in the correlations and variances. 
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Figure 3: Variance [A (m, n)]/2" for m = 16, 15, ...10 from top to bottom. The lines are 
examples of a'^2^'^^~^^^. 
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Figure 4: Kurtosis[A(m, n)], defined such that the Gaussian value is 3. The higher m values 
approach this line more closely. 
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2 Fractional Brownian motion 



Fractional Brownian motion (fBm) [5J is a random Gaussian process Bnit) with expectations 
E[BHit)] = and E[BHitf] = a^ltl"^". The Hurst exponent is < if < 1. Let us consider 
a discrete series of values Buii) for integers i. If we define the differences AH{i,j) = Buii + 
j) — Bh{i) for nonvanishing integers j then the correlations for lag k are 

C„(k,j,r) = ^tMMM±ij^. (5) 



The scale invariance (self-similarity) of fBm corresponds to Cni^k, Xj, \i) = Cuik,],!). This 
is the hallmark of fBm, and it is the desired property to mirror the scale invariance (n inde- 
pendence) in the prime number correlations. 

If Bnii) is defined to have stationary differences then the correlations are independent of 
i and are uniquely determined [5], 

ch{k,j) = ^-^ — r' ^' ■ (6) 



2|J| 



To obtain the desired negative correlations we must choose < H < 1/2. Then all Cjj{k, 1) = 
Cjj{—k, 1) < except Cjj{0, 1) = 1, and we also note the sum rule YlT=-oo^Hik, 1) = 0. 
Cjj{l, 1) and Cjj{2, 1) are displayed in Fig. (5) below. Mean reversion is characterized by 
the fact that if given only two values Bnih) and BH{i2), the expectation for Bh{oo) is 
(Sj^(zi) + -B/f(i2))/2 for any < H < 1/2. Ordinary Brownian motion has H = 1/2 and its 
expectation for Bh{oo) is i?// (max(zi, 22))- 

The fBm process as defined above misses an essential feature. The correlations ([6]) are 
independent of i and so they do not capture the observed m dependence, Ck{m), of the prime 
number correlations. For example by comparing the Ci(m)'s in the Table above to C\j{l, 1), 
the implied values of H are (0.441, 0.436, 0.428, 0.421, 0.411, 0.398, 0.381) for m = 16, 15, ...10. 
This m dependence represents a nonstationary behavior of the increments of the prime number 
series. There is also m dependence in the variances in Fig. (3), and for m = 16 the implied 
value of ii is ~ 0.475. This extraction of H from the scale dependence of the variances is 
similar in spirit to a rescaled range analysis. The latter was applied ^ to the distribution 
of large primes (the 25th million and 50th million primes) where H ^ 0.46 was obtained. 
But such an analysis does not in itself disentangle the scale invariance from the nonstationary 
behavior. As with the variances there is also a strong sensitivity, for example, to a simple re- 
weighting b{i) — > w{i)b{i). The correlations display much more robustly the scale invariance 
and the nonstationary behavior. 

The nonstationarity (a breaking of translation invariance) is perhaps not surprising given 
that the prime number series has a beginning. In fact there is another type of fractional 
Brownian motion that better reflects these properties. It was actually considered first [H [H [5] 
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due to its very simple definition in terms of a stochastic integral, 

Bnit) = Vm f it- uf-^/'^dB{u), t > 0, (7) 

where B{u) represents ordinary Brownian motion. With this definition Bnit) has a beginning 
as well as nonstationary increments. This has been referred to as Type 11 fBm in |9], where 
a comparison with the better known Type 1 fBm is made. An early study [8J of the former 
was concerned with H = 1, but the covariance obtained there can be generalized to other H 
by use of a hypergeometric function, 

E[B^{t)B^{s)] = J^f^i''^'^'-^''-'^'2F,il, l-H,l + Hi), s>t. (8) 
When {s — t)/t is small this approaches the corresponding covariance for Type 1 fBm, which 

is |s-tp^)/2. 

The correlations as defined in ([5]) can be obtained for type 11 fBm by using ([8]); they now 
have i dependence and they can be denoted by C]^{k, j, i). The scale invariance remains intact, 
C|J(A/c, Aj, \i) = C]^{k, In addition the correlations of type 1 fBm can be obtained as a 
limit, limj^oo j, = C\[{k,j). Away from this limit \C^{k, > \Cjj{k,j)\ with the 

largest difference at i = 0. In Fig. (5) we compare C^{k, 1,0) and C\[{k, 1) as a function of 
H ioik = 1,2. 
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Figure 5: C\j{k^ 1) (dashed blue) and C)J(A;, 1, 0) (solid red) as a function of H for A; = 1, 2. 

The correlations among prime numbers |Cfc(m)| decrease with increasing m, and this is 
nicely mirrored by the similar behavior of C^h{^i ^i with increasing i. (Note that our calcu- 
lation of Ck{m) would correspond to an average of C^{k, 1, i) over a range of i that grows with 
m.) Thus the properties of Type 11 fBm agree at least qualitatively with the prime number 
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series. We display in Fig. (6) the values of Ck{m) for a range of k and compare to the values 
spanned by C^{k, l,oo) and C]^{k, 1,0) for H = 0.4. The overall k dependence is in good 
qualitative agreement. But we see that the variability of C]^{k,l,i) with i is significantly 
less than the variability of Ck{m) with m. Thus while the increase in |C|}(fc, 1,0)1 relative to 
\Cjj{k, 1)1 goes in the right direction it does not go nearly far enough. 
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Figure 6: Ck{m) for m = 16, 15, ...10 from top to bottom. The upper and lower bounds on 
the yellow rectangles are Cj^{k, 1) = C^{k, 1, oo) and C]^{k, 1, 0) respectively for H = 0.4. 

We note that the implied H from the correlations increases with increasing m, while the 
implied H from the variances, if anything, decreases (from Fig. (3)). Thus for sufficiently large 
m these implied values of H may come together and perhaps the type II fBm process is more 
closely approached. But for the values of m we have studied it is clear that nonstationary 
behavior is more pronounced for the prime number b{i) series than it is for the type II fBm 
process. This raises the question of whether there is a generalization of a fBm process in 
which the nonstationarity can be adjusted while leaving the scale invariance intact. There are 
various multifractal generalizations of fBm, such as "multifractional Browning motion" [TO] 
and "fractional Brownian motion in multifractal time" [11]. For these processes the probability 
distribution of the increments may no longer be Gaussian and its shape can change as a 
function of the scaling parameter. Clearly Fig. (4) showing the kurtosis does not see evidence 
away from small n for this type of multifractal behavior. In addition these generalizations 
tend to lose the global scale invariance, in which case they are not appropriate for the b{i) 
series. 
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3 The Riemann hypothesis 



Here we focus on the scale invariance and its relation to the Riemann hypothesis. Assuming 
that this hypothesis is true then there is a relation between b{i) and the nontrivial zeros of 
the zeta function C('S)E 

bi^^.^^yMcMP^^ (9) 

where these zeros are of the form ^ + i'j. The more common version [4J of Qj has Pi replaced 
by X and b{i) replaced by the difference in ([T]), Li(x) — 7r(x)§| 

Now consider the case where the Riemann hypothesis is violated, which corresponds to 
zeros appearing off the \ + it line and in the strip between it and 1 + it. Such zeros must be 
symmetrically placed about the real axis and the \+ it line. The result of such a quartet of 
zeros at a distance d from the \ + it line and a distance T from the real axis is to add the 
following term (ignoring sub-leading terms) to the RHS of ([2]), 

2^sin(Tlog(p,)). (10) 

If such a quartet of zeros existed, then for large enough pi the new term in (fTO!) will eventually 
dominate the sum. This is well known as is the fact that there are no such zeros up to some 
very large value of T, currently of order 2.4 x 10^^ [13]. 

Nevertheless we would like to see how the scale invariance would be affected by such zeros. 
We therefore study the modified series. 



h{i) 



d 



^/Wil log(Pi) T 



+ 2^sin(Tlog(p,)) 



log (p. 



(11) 



Figs. (7) and (8) display the modified correlations and variances for d = .25 and T = 20000. 
A dramatic violation of scale invariance has occurred§| 

The violation causes a bifurcation in the correlations for some value of m; around this 
value the correlations at sufficiently large n are pushed away from their original values in 
opposite directions. By varying T we find that the value of m near the bifurcation is such 
that 2™ ^ T. This is an interesting connection between the number of primes in the samples 
where a bifurcation is detected, and the location of a d > zero of the zeta function. The 



^For ^ to become an equality the "1" should be replaced by a slowly varying function of that approaches 
1 from above as pi increases. 

^The re-weighted version of the b{i) series appearing on the LHS of ([9]) is central to other investigations 
[12j . and one may wonder whether this re- weighted series could be used in place of b{i) in the previous analysis. 
The scale invariance of the correlations would still persist, but the increments of the re- weighted series would 
have variances far from the constant of fBm. 

■^The kurtosis also deviates from its Gaussian value. 
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value of d affects the value of n where the bifurcation becomes visible; as d is decreased the 
departures from scale invariance move to larger n values. But for a fixed d and T there are 
m and n sufficiently large such that the correlations tend to unity. 
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Figure 7: Ci{m, n) — (16 — m)/10 for m = 16, 15, ...10 from top to bottom, showing a violation 
of the Riemann Hypothesis. 
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Figure 8: Variance[A(m, n)]/2" for m = 16, 15, ...10 from top to bottom, showing a violation 
of the Riemann Hypothesis. 
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We may estimate more generally the primes that would be needed to detect the broken 
scale invariance characterized by some zero with > at height T. The value of n needed to 
easily detect the distortion in the correlations is n l/dP. Thus nT^ ^ T/(P is the number 
of primes needed. The largest prime pi of this set would have i = 2^2"^ ^ 2'd^T. We note that 
Pi as a function of i has been calculated up to about 10^^ [IJ. If we use the corresponding i 
as an upper bound then this limits the sensitivity to a violation of the Riemann hypothesis. 
For example for d = .25 it would be possible to rule out such zeros with T as large as 10^^. 

We have shown that the existence of a single quartet of zeros that violate the Riemann 
hypothesis would show up as a breakdown of the scale invariance. There could be a whole 
spectrum, possibly infinite, of zeros away from the ^+it line. Although we do not prove it here, 
it appears that any such violation of the Riemann hypothesis would modify the correlations 
and/or the variances such that the scale invariance evident in Figs. (1-3) would not persist 
to arbitrarily high m and n. Conversely, the statement that scale invariance does persist to 
arbitrarily large m and n is a statement that is at least as strong as the Riemann hypothesis. 
If the preservation of scale invariance is indeed a stronger statement, then it would not only 
enforce all nontrivial zeros to lie on the line ^ + it, but it would also constrain the distribution 
of zeros along that line. Precisely what form this constraint takes is an open question. 

The distribution of zeros of the zeta function, and in particular the probability distribution 
of the spacings, has been a topic of intense study (e.g. see [13]). The most naive possibility, 
that there exists a scale invariance of correlations among zeros as we have found for the primes, 
does not appear to be the case. Correlations of nearest neighbor spacings at lag k and at high 
t were explored in [H] where it was found that C^'^™'^ ~ — 0.32/A;^. This behavior is quite 
different from the behavior for fBm, as given in ([6]), which thus suggests that the correlations 
are not scale invariant. A Hurst exponent was extracted [15j using a rescaled range analysis 
where it was found that H ^ 0.1 characterizes the distribution of zeros at high t. This scaling 
did not apply for zeros at low t and even for high t the scaling fit was poor. As mentioned 
below (Q the full x dependence of the function Li(x) —tt{x) is reproduced from the knowledge 
of the zeros, whereas the series b{i) only samples this function at a discrete set of values. Thus 
while the zeros appear to contain much more information, much of it may be superfluous as 
far as the scale invariance is concerned. 

In summary we have described the scale invariance of the negative correlations that exist 
in the distribution of prime numbers. The scale invariance {n independence) was separated 
from a nonstationarity (m dependence) and these results encompass both large and small 
primes. The prime numbers are providing a nontrivial and fundamental manifestation of 
scale invariance. We have described a lesser known version of fractional Brownian motion 
that enjoys a simple definition and that displays the same properties as the prime number 
series. But to obtain precise agreement, the prime numbers are suggesting some further scale 
invariant generalization of fBm. We have also argued that this structure is of mathematical 
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interest, since the scale invariance appears to imply the Riemann hypothesis. There may thus 
be motivation to directly test the scale invariance to much larger primes, and we estimated the 
number of primes needed to set competitive limits on violations of the Riemann hypothesis. 
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